Search Result

Journals

Publication Years

Keywords

Please wait a minute...

For Selected:

Download Citations
EndNote Ris BibTeX

Toggle Thumbnails

Select

Q-table initialization approach for safe exploration based on factorization machine

Bosen ZENG, Yong ZHONG, Xianhua NIU

Journal of Computer Applications 2022, 42 (1): 209-214. DOI: 10.11772/j.issn.1001-9081.2021020239

Abstract （443）

HTML （11）

PDF （873KB）（98）

Save

In order to solve the problem that most exploration/exploitation strategies of reinforcement learning ignore the risk brought by the agent action selection with random components in exploration process， a Q-table initialization approach based on Factorization Machine （FM） was proposed for safe exploration. Firstly， the explored Q-values were introduced as prior knowledge， and then FM was used to build the model of potential interaction between states and actions in the prior knowledge. Finally， the unknown Q-values in Q-table were predicted based on this model to further guide the exploration of the agents. A/B testing was conducted in the grid reinforcement learning environment Cliffwalk of OpenAI Gym. The number of bad exploration episodes of Boltzmann and Upper Confidence Bound （UCB） exploration/exploitation strategies based on the proposed approach are reduced by 68.12% and 89.98% respectively. Experimental results show that the proposed approach improves the safety of exploration， and accelerates the convergence at the same time.

Table and Figures | Reference | Related Articles | Metrics